Combining statistical alignment and phylogenetic footprinting to detect regulatory elements

نویسندگان

  • Rahul Satija
  • Lior Pachter
  • Jotun Hein
چکیده

MOTIVATION Traditional alignment-based phylogenetic footprinting approaches make predictions on the basis of a single assumed alignment. The predictions are therefore highly sensitive to alignment errors or regions of alignment uncertainty. Alternatively, statistical alignment methods provide a framework for performing phylogenetic analyses by examining a distribution of alignments. RESULTS We developed a novel algorithm for predicting functional elements by combining statistical alignment and phylogenetic footprinting (SAPF). SAPF simultaneously performs both alignment and annotation by combining phylogenetic footprinting techniques with an hidden Markov model (HMM) transducer-based multiple alignment model, and can analyze sequence data from multiple sequences. We assessed SAPF's predictive performance on two simulated datasets and three well-annotated cis-regulatory modules from newly sequenced Drosophila genomes. The results demonstrate that removing the traditional dependence on a single alignment can significantly augment the predictive performance, especially when there is uncertainty in the alignment of functional regions. AVAILABILITY SAPF is freely available to download online at http://www.stats.ox.ac.uk/~satija/SAPF/

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of Regulatory Elements Using Comparative Genomics and Phylogenetic Footprinting

With the complete compilation of several genomic sequences, understanding the regulation of gene activity has become one of the primary goals for the Molecular Biology community. This research has the objective of identifying regulatory elements of human genes using Phylogenetic Footprinting. All the biological data used comes from NCBI databases(HomoloGene, EntrezGene, EntrezNucleotide), diffe...

متن کامل

CALL FOR PAPERS Comparative Genomics Identifying cis-regulatory elements by statistical analysis and phylogenetic footprinting and analyzing their coexistence and related gene ontology

Shi W, Zhou W, Xu D. Identifying cis-regulatory elements by statistical analysis and phylogenetic footprinting and analyzing their coexistence and related gene ontology. Physiol Genomics 31: 374–384, 2007. First published September 11, 2007; doi:10.1152/physiolgenomics.00085.2006.—Discovery of cis-regulatory elements in gene promoters is a highly challenging research issue in computational mole...

متن کامل

CONREAL: conserved regulatory elements anchored alignment algorithm for identification of transcription factor binding sites by phylogenetic footprinting.

Prediction of transcription-factor target sites in promoters remains difficult due to the short length and degeneracy of the target sequences. Although the use of orthologous sequences and phylogenetic footprinting approaches may help in the recognition of conserved and potentially functional sequences, correct alignment of the short transcription-factor binding sites can be problematic for est...

متن کامل

A taxonomy-traversing approach to discover cis-acting elements in prokaryotes

The increasing number of sequenced genomes opens promising avenues to apply comparative genomics in order to detect phylogenetically conserved cis-acting elements, and to study their divergence across taxonomy. This approach, called phylogenetic footprinting is based on the hypothesis that, due to selective pressure, regulatory elements tend to evolve at a slower rate than surrounding non-codin...

متن کامل

Whole Genome Human/Mouse Phylogenetic Footprinting of Potential Transcription Regulatory Signals

UNLABELLED Phylogenetic footprinting is an efficient approach for revealing potential transcription factor binding sites in promoter sequences. The idea is based on an assumption that functional sites in promoters should evolve much slower then other regions that do not bear any conservative function. Therefore, potential transcription factor (TF) binding sites that are found in the evolutional...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 24 10  شماره 

صفحات  -

تاریخ انتشار 2008